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A NOVEL GROWTH FACTOR AND A GENETIC SEQUENCE ENCODING SAME 

5 The present invention relates generally to an isolated molecule having vascular 
endothelial growth factor-like properties and to a genetic sequence encoding same. The 
molecule will be useful in the development of a range of therapeutics and diagnostics 
useful in the treatment, prophylaxis and/or diagnosis of conditions requiring enhanced 
or diminished vasculature and/or vascular permeability. 

10 

Bibliographic details of the publications referred to by author in this specification are 
collected at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for 
the nucleotide and amino acid sequences referred to in the specification are defined 
following the bibliography. 

15 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply .the inclusion of a. stated element or integer or group of elements or integers but 
not the exclusion of any other element or integer or group of elements or integers. 

20 

Vascular endothelial growth factor (hereinafter referred to as "VEGF"), also known as 
vasoactive permeability factor, is a secreted, covalently linked homodimeric glycoprotein 
that specifically activates endothelial tissues (Senger et ah, 1993). A range of functions 
have been attributed to VEGF such as its involvement in normal angiogensis including 

25 formation of the corpus luteum (Yan et ah, 1993) and placental development (Sharkey 
et ah, 1993), regulation of vascular permeability (Senger et ah, 1993), inflammatory 
angiogenesis (Sunderkotter et ah, 1994) and autotransplantation (Dissen et ah, 1994) and 
human diseases such as tumour promoting angiogenesis (Folkman & Shing, 1992), 
rheumatoid arthritis (Koch et ah, 1994) and diabetes related retinopathy (Folkman & 

30 Shing, 1992). 
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VEGF is, therefore, an important molecule making it a potentially valuable target for 
research into therapeutics, prophylactics and diagnostic agents based on VEGF or its 
activities. There is also a need to identify homologues or otherwise related molecules 
5 for use as an alternative to VEGF or in conjunction with VEGF. 

In work leading up to the present invention, the inventors sought the multiple endocrine 
neoplasia type I susceptibility gene (MEN1). Surprisingly, the inventors discovered that 
a genetic sequence excluded as a candidate for the MEN1 gene was nevertheless a new 
10 growth factor having some similarity to VEGF. 

Accordingly, one aspect of the present invention comprises a biologically isolated 
proteinaceous molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 
15 NO:2; and 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

Another aspect of the present invention provides a biologically isolated proteinaceous 
molecule having the following characteristics: 
20 (i) comprises an amino acid sequence having at least about 15% similarity but at 

least about 5% dissimilarity to all or part of the amino acid sequence set forth 

in SEQ ID NO:2; 
(ii) exhibits at least one property in common with VEGF. 

25 A related aspect of the present invention contemplates a biologically isolated 
proteinaceous molecule having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 
NO:2; 

30 (ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with flt-\lflk-\ family of receptors; 
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(c) ability to induce cell migration, cell survival and/or an increase in 
intracellular levels of alkaline phosphatase. 

By "biologically isolated" is meant that the molecule has undergone at least one step of 
5 purification from a biological source. Preferably, the molecule is also biologically pure 
meaning that a composition comprises at least about 20%, more preferably at least about 
40%, still more preferably at least about 65%, even still more preferably at least about 
80-90% or greater of the molecule as determined by weight, activity or other convenient 
means, relative to other compounds in the composition. Most preferably, the molecule 
10 is sequencably pure. 

Another preferred aspect of the present invention provides the molecule in recombinant 
form. 

15 According to this aspect of the present invention, there is provided a recombinant 
molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 

- . NO;2; and- • 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

20 

A related aspect of the present invention is directed to a recombinant molecule having 
the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to all or part of the amino acid sequence set forth 

25 in SEQ ID NO:2; 

(ii) exhibits at least one property in common with VEGF. 

A further related aspect of the present invention contemplates a recombinant molecule 
having the following characteristics: 
30 (i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 
NO:2; 
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(ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with flt-\IJlk-\ family of receptors; 

(c) ability to induce cell migration, cell survival and/or an increase in 



The present invention also contemplates genomic or partial genome clones encoding a 
proteinaceous molecule having at least about 15% amino acid similarity but at least 
about 5% dissimilarity to SEQ ID NO:l. 



The amino acid sequence set forth in SEQ ID NO:2 corresponds to human VEGF 
(referred to herein as "VEGF 165 "). Accordingly, the molecule of the present invention 
is VEGF-like or is a homologue of VEGF but comprises an amino acid sequence which 
is similar but non-identical to the amino sequence of VEGF. Although the present 

15 invention is exemplified using a human VEGF-like molecule, this is done with the 
understanding that the instant invention contemplates the homologous molecule and 
encoding sequence from other mammals such as livestock animals (e.g. sheep, pigs, 
horses and cows), companion animals (e.g. dogs and cats) and laboratory test animals 
(e.g. mice, rats, rabbits and guinea pigs) as well as non-mammals such as birds (e.g. 

20 poultry birds), fish and reptiles. In a most preferred embodiment, the VEGF-like 
molecule is of human origin and encoded by a gene located at chromosome 1 lql3. The 
present invention extends, therefore, to the genomic sequence or part thereof encoding 
the subject VEGF-like molecule. 

25 Preferably, the percentage similarity is at least about 30%, more preferably at least about 
40%, still more preferably at least about 50%, still even more preferably at least about 
60-70%, yet even more preferably at least about 80-95% to all or part of the amino acid 
sequence set forth in SEQ ID NO:2. 

30 In a particularly preferred embodiment, the VEGF-like molecule of the present invention 
comprises a sequence of amino acids as set forth in SEQ ID NO:4 or is a part, fragment, 
derivative or analogue thereof. The amino acid sequence set forth in SEQ ID NO:4 is 



5 



intracellular levels of alkaline phosphatase. 



10 



951 1 lO.prNoperSejtuANGF.prv^ 




- 5 - 

also referred to herein as "SOM175 short M . Particularly preferred similarities include 
about 19-20%, and 29-30%. Reference herein to derivatives also includes splice 
variants. Accordingly, the present invention extends to splice variants of SOM175 shorr 
Examples of splice variants contemplated by the present invention include but are not 
5 limited to variants with an amino acid sequence substantially as set forth in at least one 
of SEQ ID NO:6, SEQ ID NO:8 and/or SEQ ID NO: 10 or mutants or derivatives or 
further splice variants thereof. 

Another embodiment provides a recombinant molecule having the following 
1 0 characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:4 or having at 
least about 15% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 
1 5 (ii) exhibits at least one biological property in common with VEGF. 

Another embodiment provides a recombinant molecule having the following 
characteristics: • ...... . » - 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:6 or having at 
20 least about 15% similarity to all or part thereof provided that said amino acid 

sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

25 Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO: 8 or having at 
least about 1 5% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to ail or part of the amino acid sequence 

30 set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 
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Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having at 
least about 15% similarity to all or part thereof provided that said amino acid 

5 sequence is at least about 5% dissimilar to all or part of the amino acid sequence 

set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

Such properties of VEGF include at least one of: 
10 (a) ability to induce proliferation of vascular endothelial cells; 

(b) an ability to interact with flt-l/flk-l family of receptors; 

(c) an ability to induce cell migration, cell survival and/or an increase in intracellular 
levels of alkaline phosphatase. 

15 In accordance with the present invention, a preferred similarity is at least about 40%, 
more preferably at least about 50% and even more preferably at least about 65% 
similarity. 

Still a further aspect of the present invention contemplates a peptide fragment 
20 corresponding to a portion of the amino acid sequence set forth in SEQ ID NO:4 or a 
splice variant thereof such as set forth in SEQ ID NO:6, SEQ ID NO:8 or SEQ ID 
NO: 10 or a chemical equivalent thereof. The biologically isolated or recombinant 
molecule of the present invention may be naturally glycosylated or may comprise an 
altered glycosylation pattern depending on the cells from which it is isolated or 
25 synthesised. For example, if produced by recombinant means in prokaryotic organisms, 
the molecule would be non-glycosylated. The molecule may be a full length, naturally 
occurring form or may be a truncated or otherwise derivatised form. 

Yet another aspect of the present invention is directed to a nucleic acid molecule 
30 encoding the VEGF-like molecule herein described. More particularly, the present 
invention provides a nucleic acid molecule comprising a sequence of nucleotides 
substantially as set forth in SEQ ID NO:3 or having at least 15% similarity to all or part 
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thereof or being capable of hybridising under low stringency conditions to a reverse 
complement of the nucleotide sequence as set forth in SEQ ID NO:3 provided that the 
nucleic acid sequence having at least 15% similarity but at least 30% dissimilarity to the 
nucleotide sequence as set forth in SEQ ID NO:3. The nucleotide sequence set forth in 
5 SEQ ID NO:3 is also referred to herein as "SOM175". Preferably, the percentage 
dissimilarity is about 35%, more preferably about 39% and even more preferably about 
40-50% or greater. 

For the purposes of defining the level of stringency, reference can conveniently be made 
10 to Sambrook et al (1989) at pages 9.47-9.51 which is herein incorporated by reference 

where the washing steps disclosed are considered high stringency. A low stringency is 

defined herein as being in 4-6X SSC/0. 1-0.5% w/v SDS at 37-45°C for 2-3 hours. 

Depending on the source and concentration of nucleic acid involved in the hybridisation, 

alternative conditions of stringency may be employed such as medium stringent 
15 conditions which are considered herein to be 1-4X SSC/0.25-0.5% w/v SDS at > 45°C 

for 2-3 hours or high stringent conditions considered herein to be 0.1 -IX SSC/0. 1% w/v 

SDS at 60°C for 1-3 hours. 

The present invention further contemplates a nucleic acid molecule which encodes a 
20 VEGF-like molecule as hereinbefore described having at least 15% nucleotide sequence 
homology to SEQ ID NO:3. Preferred levels of homology include at least about 40%, 
more preferably around 60-70%. 

The VEGF-like molecule of the present invention will be useful in the development of 
25 a range of therapeutic and/or diagnostic applications alone or in combination with other 
molecules such as VEGF. The present invention extends, therefore, to pharmaceutical 
compositions comprising the VEGF-like molecule or parts, fragments, derivatives, 
homologues or analogues thereof together with one or more pharmaceutically acceptable 
carriers and/or diluents. Furthermore, the present invention extends to vectors 
30 comprising the nucleic acid sequence set forth in SEQ ID NO:3 or having at least about 
15%, more preferably about 40% and even more preferably around 60-70% similarity 
thereto but at least 30% and more preferably around 39% dissimilarity thereto and host 
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cells comprising same. In addition, the present invention extends to ribozymes and 
antisense molecules based on SEQ ID NO:3 as well as neutralizing antibodies to the 
VEGF-like molecule. Such molecules may be useful in ameliorating the effects of, for 
example, over expression of VEGF-like genes leading to angiogenesis or vascularization 
5 of tumours. 

The present invention also contemplates antibodies to the VEGF-like molecule or nucleic 
acid probes to a gene encoding the VEGF-like molecule which are useful as diagnostic 
agents. 

10 

The present invention is further described by reference to the following non-limiting 
Figures and/or Examples. 

In the Figures- 
IS 

Figure 1 Nucleotide sequence [SEQ ID NO:l] and corresponding amino acid 
sequence [SEQ ID NO:2] of VEGF 165 . 

Figure 2 Nucleotide sequence [SEQ ID NO:3] and corresponding amino acid 
20 sequence [SEQ ID NO:4] of SOM175. 

Figure 3 Results of BLAST search with SOM175 protein sequence. 

Figure 4 BESTFIT alignment of VEGF cDNA and SOM175 cDNA. 

25 

Figure 5 Multiple alignment of VEGF 165 with SOM175 and its splice variants at 
the nucleotide level. 

Figure 6 Multiple alignment of VEGF 165 with SOM175 and its splice variants at 
30 the amino acid level. 

Figure 7 Diagrammatic representation of SOM175 and its splice variants. 
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Figure 8(a) Diagrammatic representation of genomic structure of human SOM175 
genomic showing exon/intron map. 

Figure 8(b) Diagrammatic representation of genomic structure of human SOM175 
5 showing exon/intron boundries. 

TABLE 1 

SUMMARY OF SEQUENCE IDENTITY NUMBERS 



10 



SEQ 


ID 


NO:l 


Nucleotide sequence of VEGF 165 


SEQ 


ID 


NO:2 


Amino acid sequence of VEGF 165 


SEQ 


ID 


NO:3 


Nucleotide sequence of SOM175 (VEGF-like molecules) 


SEQ 


ID 


NO:4 


Amino acid sequence of SOM175 


15 SEQ 


ID 


NO:5 


Nucleotide sequence of SOM175 absent exon 6 


SEQ 


ID 


NO:6 


Amino acid sequence of SOM175 absent exon 6 


SEQ 


ID 


NO:7 


Nucleotide sequence of SOM175 absent exon 6 and exon 7 


SEQ 


ID 


NO:8 


Amino acid sequence of SOM175 absent exon 6 and exon 7 


SEQ 


ID 


NO:9 


Nucleotide sequence of SOM175 absent exon 4 


20 SEQ 


ID 


NO:10 


Amino acid sequence of SOM175 absent exon 4 
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EXAMPLE 1 

Human cDNA clones 

The original SOM175 cDNA was isolated by screening a human foetal brain library 
5 (AzapII, Stratagene) with the cosmid D11S750 (Larsson et al t 1992) using methods 
previously described (Xu et al 1992). The plasmid was excised "in vivo" and a single 
1.1 kb cDNA was obtained. Three independent SOM175 cDNAs clones were also 
isolated from a human foetal spleen library (Strategane, Uni-zap) using the above- 
mentioned SOM175 insert as a probe. Three clones were obtained: SOM175-4A, -5 A 
10 and -6A. SOM175-5A is an alternately spliced clone with exon 4 being absent 
(SOM175-e4). These library screens were performed using hybridisation conditions 
recommended by the manufacturer of the library (Stratagene) and random primed insert 
ofSOM175. 

1 5 Two partial human SOM175 cDNAs have also isolated from a A.GT1 1 human melanoma 
cell line A2058 library (Clontech) cDNA library screens were performed using 
hybridisation conditions described by Church and Gilbert, 1984). In each case, the 
probe was generated by random priming of a PGR product derived from SOM175 (18f- 
700r). 

20 

Mouse cDNA Clones 

Human SOM175 was also used to screen a mouse neonatal whole brain cDNA library 
(Unizap, Stratagene). Four non-chimeric clones were isolated: M175-A, B, C, D. All 
clones were partial cDNAs and M175-C contained several introns. Three of these 
25 cDNAs lacked the exon 6. 

Another clone referred to as Ml was completely sequenced and was found to contain 
the full open reading frame plus part of the 5'utr and total 3'utr. 
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EXAMPLE 2 
DNA SEQUENCE ANALYSIS 
The entire sequence of the cDNA clone (SOM175) was compiled and is shown in 
5 Figure 2 with its corresponding amino acid sequence. This sequence was screened for 
open reading frames using the MAP program (GCG, University of Wisconsin). A single 
open reading frame of 672bp was observed (see Figure 2). There appears to be little 5' 
untranslated sequences (2bp). The 3' untranslated region appears to be complete as it 
includes a poly-adenylation signal and poly-A tail. 

10 

Database homology searches were performed using the BLAST algorithm (run at NCBI, 
USA). This analysis revealed homology to several mammalian forms of VEGF (see 
Figure 3). The amount of homology between SOM175 and human VEGF 165 was 
determined using the BESTFIT program (GCG, University of Wisconsin; see Figures 
15 4 and 5). Nucleotide homology was estimated at 69.7% and protein homology was 
estimated as at least 33.3% identity and 52.5% conservation using BESTFIT analysis. 
BLAST analysis on nucleotide sequences revealed the almost complete match to a 
human expressed sequence tag EST06302 (Adams et. ah, 1993).. ^ 

20 These data indicate that SOM175 encodes a growth factor that has structural similarities 
to VEGF. Both genes show start and stop codons in similar positions and share discrete 
blocks of homology. All 8 cysteines as well as a number of other VEGF residues 
believed to be involved in dimerisation are conserved. These residues are Cysteine-47, 
Proline-70, Cysteine-72, Valine-74, Arginine-77, Cysteine-78, Glycine-80, Cysteine-81, 

25 Cysteine-82, Cysteine-89, Proline-91, Cysteine-122 and Cysteine-124 and are shown in 
Figure 6. Given the structural conservation between VEGF and the SOM175 gene 
product it is also possible that they share functional similarities. It is proposed that 
SOM175 encodes a VEGF-like molecule that shares some properties with VEGF but has 
unique properties of its own. The nucleotide sequence and corresponding amino acid 

30 sequence of VEGF 165 is shown in Figure 1. 
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EXAMPLE 3 

The percentage similarity and divergence between VEGF 165 family and SOM175 family 
(protein) were analysed using the Clustal method, MegAlign Software, DNASTAR, 
5 Wisconsin. The results are shown in Tables 2.1 and 2.2. The alternatively spliced 
forms of SOM175 are abbreviated to SOM715-e6 where all of exon 6 is deleted; 
SOM715-e6 and 7 where all of exons 6 and 7 are deleted; and SOM175-e4 where all 
of exon 4 is deleted. The spliced form of SOM175 are shown in Figure 7. Genomic 
maps of SOM175 showing intron/exon boundaries are shown in Figure 8a and 8b. 

10 

Table 2.1 

A Percent nucleotide similarity between splice variants of SOM175 and 
human VEGF 165 

15 





VEGF 165 


SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


' "^"165 


*** 


34.9 




41.4 


37.0 


SOM175 




*** 


98.9 


95.1 


99.2 


SOM175-e6 






*** 


98.8 


84.0 


SOM175-e6&7 








*** 


80.3 


SOM175-e4 










*** 
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B Percent nucleotide divergence between splice variants of SOM175 and 
human VEGF 1fi<! 





VEGF 165 


SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


VEGF 165 


*** 


41.7 


41.6 


41.7 


41.8 


SOM175 




*** 


0.2 


0.2 


0.0 


SOM175-e6 






*** 


0.0 


0.2 


SOM175-e6&7 








*** 


0.3 


SOM175-e4 










*** 



Table 2.2 

15 A Percent amino acid identity between splice variants of SOM175 and 
human VEGF 165 





VEGF 16 g 


SOMI75 


SOMI75-e6 


SOM175-e6&7. 


SOM175>e4 


VEGF 165 


*** 


31.4 


42.3 


33.5 


40.6 


SOM175 




*** 


74.7 


73.7 


99.1 


SOM175-e6 






*** 


76.8 


99.1 


SOM175-e6&7 








*** 


99.1 


SOM175-e4 










*** 



25 
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B Percent amino acid divergence between splice variants of SOM175 and 
human VEGF^c 



5 




VEGF 165 SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 




VEGF 165 


*** 657 


55.4 


54.6 


57.4 




SOM175 


*** 


19.9 


4.2 


0.0 




SOM175-e6 




*** 


0.0 


0.0 


10 


SOM175-e6&7 






*** 


0.0 




SOM175-e4 








*** 



15 EXAMPLE 4 

BIOASSAYS TO DETERMINE THE FUNCTION OF SOM175 

Assays are conducted to evaluate whether SOM175 has similar activities to VEGF on 
endothelial cell function, angiogenesis and wound healing, Other assays are 
performed based on the results of receptor binding distribution studies. 

20 

Assays of endothelial cell function 

Endothelial cell proliferation. Endothelial cell growth assays as described in Ferrara 
& Henzel (1989) and in Gospodarowicz et al (1989). 

25 Vascular permeability assay. This assay, which utilises the Miles test in guinea pigs, 
will be performed as described in Miles & Miles (1952). 

Cell adhesion assay. The influence of SOM175 on adhesion of polymorphs to 
endothelial cells is analysed. 

30 

Chemotaxis. This is performed using the standard Boyden chamber chemotaxis 
assay. 
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Plasminogen activator assay. Endothelial cells are tested for plasminogen activator 
and plasminogen activator inhibitor production upon addition of SOM175 (Pepper et 
al (1991)). 

Endothelial cell migration assay. The ability of SOM175 to stimulate endothelial 
cells to migrate and form tubes is assayed as described in Montesano et al (1986). 

Angiogenesis Assay 

SOM175 induction of an angiogenic response in chick chorioallantoic membrane is 
evaluated as described in Leung et al (1989). 

Possible neurotrophic actions of SOM175 are assessed using the following assays: 

Neurite outgrowth assay and gene induction (PC12 cells) 

15 PC 12 cells (a phaeochromocytoma cell line) respond to NGF and other neurotrophic 
factors by developing the characteristics of sympathetic neurons, including the 
induction of early and late genes and the extension of neurites. These cells are 
exposed to SOM175 and their response monitored (Drinkwater. et al (1991): and: 
Drinkwater et al (1993)). 

20 

Cultured neurons from the Peripheral Nervous System (PNS) 

Primary cultures of the following PNS neurons are exposed to SOM175 and 

monitored for any response: 

- sensory neurons from neural crest and dorsal root ganglia 
25 - sympathetic neurons from sympathetic chain ganglia 

- placode derived sensory neurons from nodose ganglia 

- motoneurons from spinal cord 

The assays are described in Suter et al (1992) and in Marinou et al (1992). 

30 Where an in vitro response is observed, in vivo assays for properties such as uptake 
and retrograde transport are performed as described in Hendry et al (1992). 



5 



10 
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Nerve regeneration (PNS) 

Where neurotrophic effects of SOM175 are observed, its possible role in the 
regeneration of axotomised sensory neurons, sympathetic neurons and motoneurons is 
analysed by the methods of Otto et al (1989); Yip et al (1984) and Hendry et al 
5 (1976). 

Actions of SOM175 on CNS neurons 

The ability of SOM175 to promote survival of central nervous system neurons is 
analysed as described in Hagg et al (1992); Williams et al (1986); Hefti (1986) and 
10 Kromer (1987). 

Wound Healing 

The ability of SOM175 to support wound healing are tested in the most clinically 
relevant model available, as described in Schilling et al (1959) and utilised by Hunt 
15 et al (1967). 

The Haemopoietic System 

A variety of in vitro and in vivo assays on specific cell populations of the 
haemopoietic system are available and are outlined below: 
20 Stem Cells 
Murine 

A variety of novel in vitro murine stem cell assays have been developed using 
FACS-purified cells: 
(a) Repopulating Stem Cells 
25 These are cells capable of repopulating the bone marrow of lethally irradiated mice, 
and have the Lin", Rh hl , Ly-6A/E + , c-kit + phenotype. The test substance is tested on 
these cells either alone, or by co-incubation with multiple factors, followed by 
measurement of cellular proliferation by 3 H thymidine incorporation. 
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(b) Late Stage Stem Cells 

These are cells that have comparatively little bone marrow repopulating ability but 
can generate D13 CFU-S. These cells have the Lin", Rh h \ Ly-6A/E + , c-kit + 
5 phenotype. The test substance is incubated with these cells for a period of time, 
injected into lethally irradiated recipients, and the number of D13 spleen colonies 
enumerated. 

(c) Progenitor-Enriched Cells 

10 These are cells that respond in vitro to single growth factors, and have the Lin", Rh hl , 
Ly-6A/E + , c-kit + phenotype. This assay will show if SOM175 can act directly on 
haemopoietic progenitor cells. The test substance is incubated with these cells in 
agar cultures, and the number of colonies enumerated after 7-14 days. 

1 5 Atherosclerosis 

Smooth muscle cells play a crucial role in the development or initiation of 
atherosclerosis, requiring a change in their phenotype from a contractile to a synthetic 
state. -Macrophages, endothelial cells, T lymphocytes and platelets all play a role in 
the development of atherosclerotic plaques by influencing the growth and phenotypic 
20 modulations of smooth muscle cell. An in vitro assay that measures the proliferative 
rate and phenotypic modulations of smooth muscle cells in a multicellular 
environment is used to assess the effect of SOM175 on smooth muscle cells. The 
system uses a modified Rose chamber in which different cell types are seeded onto 
opposite coverslips. 

25 

Effects of SOM175 on bone 

The ability of SOM175 to regulate proliferation of osteoblasts is assayed as described 
in Lowe et al (1991). Any effects on bone resorption are assayed as described in 
Lowe et al (1991). Effects on osteoblast migration and changes in intracellular 
30 molecules (e.g. cAMP accumulation, alkaline phosphatase levels) are analysed as 
described in Midy et al (1994). 
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Effects on skeletal muscle cells 

Effects of SOM175 on proliferation of myoblasts and development of myotubes can 
be determined as described by Ewton et al (1980) and by Gospodarowicz et al 
(1976). 

5 

Those skilled in the art will appreciate that the invention described herein is 
susceptible to variations and modifications other than those specifically described. It 
is to be understood that the invention includes all such variations and modifications. 
The invention also includes all of the steps, features, compositions and compounds 
10 referred to or indicated in this specification, individually or collectively, and any and 
all combinations of any two or more of said steps or features. 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17... 589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



TCGGGCCTCC GAAACC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC 4 9 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser 
15 10 

CTT GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA 97 
Leu Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala 
15 20 25 

CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC 14 5 

Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe 
30 35 40 

ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG 193 
Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val 
45 50 5b 

GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA 241 
Asp lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro 
60 65 70 75 

TCC TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC 28 9 

Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly 
80 85 90 

CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG 337 
Leu Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin lie Met 
95 100 105 

CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA 385 
Arg lie Lys Pro His Gin Gly Gin His lie Gly Glu Met Ser Phe Leu 
110 115 120 

CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA 433 
Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin 
125 130 135 
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GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA 4 81 

Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val 
140 145 150 . 155 

CAA GAT CCG CAo ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT 52 9 

Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg 
160 165 170 

TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC 577 
Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
175 180 185 

AAG CCG AGG CGG TGAGCCGGGC AGGAGGAAGG AGCCTCCCTC AGCGTTTCGG 62 9 

Lys Pro Arg Arg 
190 

GAACCAGATC TCTC AC CAGG 64 9 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 
<B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu Leu 
1 5 10 15 

Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Met Ala Glu Gly 
20 25 30 

Gly Gly Gin Asn His His Glu Val Val Lys Phe Met Asp Val Tyr Gin 
35 40 45 

Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp lie Phe Gin Glu 
50 55 60 

Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser Cys Val Pro Leu 
65 70 75 80 

Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val Pro 
85 90 95 

Thr Glu Glu Ser Asn lie Thr Met Gin lie Met Arg lie Lys Pro His 
100 105 110 

Gin Gly Gin His lie Gly Glu Met Ser Phe Leu Gin His Asn Lys Cys 
115 120 125 
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Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu Asn Pro Cys Gly 
130 135 140 

Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr 
145 150 155 160 

Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin 
165 170 175 

Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
180 185 190 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 28 7 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 
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CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAG CCG AGC AGT CAG CTG 33 5 

Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 383 
Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAC AGG GCT GCC ACT CCC CAC CAC 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His 
130 135 140 

CGT CCC CAG CCC CGT TCT GTT CCG GGC TGG GAC TCT GCC CCC GGA GCA 47 9 

Arg Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala 
145 150 155 

CCC TCC CCA GCT GAC ATC ACC CAT CCC ACT CCA GCC CCA GGC CCC TCT 52 7 

Pro Ser Pro Ala Asp lie Thr His Pro Thr Pro Ala Pro Gly Pro Ser 
160 165 170 175 

GCC CAC GCT GCA CCC AGC ACC ACC AGC GCC CTG ACC CCC GGA CCT GCC 575 
Ala His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala 
180 185 190 

GCT GCC GCT GCC GAC GCC GCA GCT TCC TCC GTT GCC AAG GGC GGG GCT T 624 
Ala Ala Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 

AGAGCTCAAC CCAGACACCT GCAGGTGCCG GAAGCTGCGA AGGTGACACA TGGCTTTTCA 6 84 

GACTCAGCAG GGTGACTTGC CTCAGAGGCT ATATCCCAGT GGGGGAACAA AGGGGAGCCT 744 

GGTAAAAAAC AGCCAAGCCC CCAAGACCTC AGCCCAGGCA GAAGCTGCTC TAGGACCTGG 8 04 

GCCTCTCAGA GGGCTCTTCT GCCATCCCTT GTCTCCCTGA GGCCATCATC AAACAGGACA 864 

GAGTTGGAAG AGGAGACTGG GAGGCAGCAA GAGGGGTCAC ATACCAGCTC AGGGGAGAAT 924 

GGAGTACTGT CTCAGTTTCT AACCACTCTG TGCAAGTAAG CATCTTACAA CTGGCTCTTC 984 

CTCCCCTCAC TAAGAAGACC CAAACCTCTG CATAATGGGA TTTGGGCTTT GGTACAAGAA 1044 

CTGTGACCCC CAACCCTGAT AAAAGAGATG GAAGGAAAAA AAAAAAAAAA 1094 
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(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 

Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu Gly 
100 105 110 

Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys Lys 
115 120 125 

Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His Arg 
130 135 140 

Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala Pro 
145 150 155 160 

Ser Pro Ala Asp lie Thr His Pro Thr Pro Ala Pro Gly Pro Ser Ala 
165 170 175 

His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala Ala 
180 185 190 

Ala Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 



95 1 1 lO.prWpcrNejtwANGF.prv^e 



-27- 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) jjKNGTH : 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 566 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
1 5 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC £; 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 287 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 33 5 

Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 3 83 

Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAT AGC CCC AGG CCC CTC TGC CCA 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Ser Pro Arg Pro Leu Cys Pro 
130 135 140 



951 ! I0,p:\opcr^MJsIGF.prv,27 



- 28 - 



CGC TGC ACC CAG CAC CAC CAG CGC CCT GAC CCC CGG ACC TGC CGC TGC 479 
Arg Cys Thr Gin His His Gin Arg Pro Asp Pro Arg Thr Cys Arg Cys 
145 150 155 

CGC TGC CGA CGC CGC AGC TTC CTC CGT TGC CAA GGG CGG GGC TTA GAG 527 
Arg Cys Arg Arg Arg Ser Phe Leu Arg Cys Gin Gly Arg Gly Leu Glu 
160 165 170 175 

CTC AAC CCA GAC ACC TGC AGG TGC CGG AAG CTG CGA AGG TGACACATGG 576 
Leu Asn Pro Asp Thr Cys Arg Cys Arg Lys Leu Arg Arg 
180 185 

CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGAGGCTATA TCCCAGTGGG GGAACAAAGG 6 36 

GGAGCCTGGT AAAAAACAGC CAAGCCCCCA AGACCTCAGC CCAGGCAGAA GCTGCTCTAG 6 96 

GACCTGGGCC TCTCAGAGGG CTCTTCTGCC ATCC CTTGTC TCCCTGAGGC CATCATCAAA 756 

CAGGACAGAG TTGGAAGAGG AGACTGGGAG GCAGCAAGAG GGGTCACATA CCAGCTCAGG 816 

GGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGTAAGCAT CTTACAACTG 8 76 

GCTCTTCCTC CCCTCACTAA GAAGACCCAA ACCTCTGCAT AATGGGATTT GGGCTTTGGT 93 6 

ACAAGAACTG TGACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA AAAAAAA 993 



(2) INFORMATION FOR SEQ ID NO : 6 : 

{ i ) S EQUENCE CHARACTER ISTICS : 

(A) LENGTH: 188 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Met Ser Pro Leu Leu Arg Arg Leu 
1 5 

Ala Pro Ala Gin Ala Pro Val Ser 
20 

Arg Lys Val Val Ser Trp lie Asp 
35 40 

Pro Arg Glu Val Val Val Pro Leu 
50 55 

Ala Lys Gin Leu Val Pro Ser Cys 
65 70 



Leu Leu Ala Ala Leu Leu Gin Leu 
10 15 

Gin Pro Asp Ala Pro Gly His Gin 
25 30 

Val Tyr Thr Arg Ala Thr Cys Gin 
45 

Thr Val Glu Leu Met Gly Thr Val 
60 

Val Thr Val Gin Arg Cys Gly Gly 
75 80 
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Cys Cys Pro Asp 



Val Arg Met Gin 
100 

Glu Met Ser Leu 
115 

Lys Asp Ser Ala 
130 

Cys Thr Gin His 
14 5 

Cys Arg Arg Arg 



Asn Pro Asp Thr 
180 



Asp Gly Leu Glu 
85 

lie Leu Met lie 



Glu Glu His Ser 
120 

Val Lys Pro Asp 
135 

His Gin Arg Pro 
150 

Ser Phe Leu Arg 
165 

Cys Arg Cys Arg 



Cys Val Pro Thr 
90 

Arg Tyr Pro Ser 
105 

Gin Cys Glu Cys 



Ser Pro Arg Pro 
140 

Asp Pro Arg Thr 
155 

Cys Gin Gly Arg 
170 

Lys Leu Arg Arg 
185 



Gly Gin His Gin 
95 

Ser Gin Leu Gly 
110 

Arg Pro Lys Lys 
125 

Leu Cys Pro Arg 



Cys Arg Cys Arg 
160 

Gly Leu Glu Leu 
175 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
1 5 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp He Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 
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GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 
Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 28 7 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 33 5 
Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 3 83 
Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAT AGG TGC CGG AAG CTG CGA AGG 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Cys Arg Lys Leu Arg Arg 
130 135 140 

TGACACATGG CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGAGGCTATA TCCCAGTGGG 4 91 

GGAACAAAGG GGAGCCTGGT AAAAAACAGC CAAGCCCCCA AGACCTCAGC CCAGGCAGAA 5 51 

GCTGCTCTAG GACCTGGGCC TCTCAGAGGG CTCTTCTGCC ATCCCTTGTC TCCCTGAGGC 611 

CATCATCAAA CAGGACAGAG TTGGAAGAGG AGACTGGGAG GCAGCAAGAG GGGTCACATA 671 

CCAGCTCAGG GGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGTAAGCAT 731 

CTTACAACTG GCTCTTCCTC CCCTCACTAA G AAG AC C CAA AC CT CTG CAT AATGGGATTT 791 

GGGCTTTGGT ACAAGAACTG TGACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA 8 51 

AAAAAAA 858 



95 1 1 10,p:\opef\ejiMVNGF.prv^0 




- 31 - 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS : 

va) LENGTH: 143 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met Ser Pro Leu 
1 

Ala Pro Ala Gin 
20 

Arg Lys Val Val 
35 

Pro Arg Glu Val 
50 

Ala Lys Gin Leu 
65 

Cys Cys Pro Asp 



Val Arg Met Gin 
100 

Glu Met Ser Leu 
115 

Lys Asp Ser Ala 
130 



Leu Arg Arg Leu 
5 

Ala Pro Val Ser 



Ser Trp lie Asp 
40 

Val Val Pro Leu 
55 

Val Pro Ser Cys 
70 

Asp Gly Leu Glu 
85 

lie Leu Met lie 



Glu Glu His Ser 
120 

Val Lys Pro Asp 
135 



Leu Leu Ala Ala 
10 

Gin Pro Asp Ala 
25 

Val Tyr Thr Arg 



Thr Val Glu Leu 
60 

Val Thr Val Gin 
75 

Cys Val Pro Thr 
90 

Arg Tyr Pro Ser 
105 

Gin Cys Glu Cys 



Arg Cys Arg Lys 
140 



Leu Leu Gin Leu 
15 

Pro Gly His Gin 
30 

Ala Thr Cys Gin 
45 

Met Gly Thr Val 



Arg Cys Gly Gly 
80 

Gly Gin His Gin 
95 

Ser Gin Leu Gly 
110 

Arg Pro Lys Lys 
125 

Leu Arg Arg 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3. .3 05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 



Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 



191 



GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 28 7 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ACC TAAAAAAAAG GACAGTGCTG TGAAGCCAGA 33 5 
Gin Val Arg Met Gin Thr 
100 

CAGGGCTGCC ACTCCCCACC ACCGTCCCCA GCCCCGTTCT GTTCCGGGCT GGGACTCTGC 3 95 

CCCCGGAGCA CCCTCCCCAG CTGACATCAC CCATCCCACT CCAGCCCCAG GCCCCTCTGC 45 5 

CCACGCTGCA CCCAGCACCA CCAGCGCCCT GACCCCCGGA CCTGCCGCTG CCGCTGCCGA 515 

CGCCGCAGCT TCCTCCGTTG CCAAGGGCGG GGCTTAGAGC TCAACCCAGA CACCTGCAGG 575 

TGCCGGAAGC TGCGAAGGTG ACACATGGCT TTTCAGACTC AGCAGGGTGA CTTGCCTCAG 63 5 
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AGGCTATATC CCAGTGGGGA ACAAAGAGGA GCCTGGTAAA AAACAGCCAA GCCCCCAAGA 695 

CCTCAGCCCA GGCAGAAGCT GCTCTAGGAC CTGGGCCTCT CAGAGGGCTC TTCTGCCATC 755 

CCTTGTCTCC CTGAGGCCAT CATCAAACAG GACAGAGTTG GAAGAGGAGA CTGGGAGGCA 815 

GCAAGAGGGG TCACATACCA GCTCAGGGGA GAATGGAGTA CTGTCTCAGT TTCTAAC CAC 875 

TCTGTGCAAG TAAGCATCTT ACAACTGGCT CTTCC 910 

(2) INFORMATION FOR SEQ ID NO; 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp He Asp Val Tyr Thr Arg Ala Thr Cys Gin 

. . \ : \ 35. . . ... . . , 40 45 • . — 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 

Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin Thr 
100 
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FIGURE 1 

TCGGGCCTCC GAAACC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC 4 9 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser 
15 10 

CTT GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA 97 
Leu Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala 
15 20 25 

CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC 14 5 

Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe 
30 35 40 

ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG 193 
Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val 
45 50 55 

GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA 241 
Asp lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro 
60 65 70 75 

TCC TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC 2 89 

Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly 
80 85 90 

CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG 33 7 

Leu Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin lie Met 
95 100 105 

CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA 385 
Arg lie Lys Pro His Gin Gly Gin His lie Gly Glu Met Ser Phe Leu 
110 115 120 

CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA 433 
Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin 
125 130 135 

GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA 4 81 

Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val 
140 145 150 155 

CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT 52 9 

Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg 
160 165 170 

TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC 577 
Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
175 180 185 

AAG CCG AGG CGG TGAGCCGGGC AGGAGGAAGG AGCCTCCCTC AGCGTTTCGG 62 9 

Lys Pro Arg Arg 
190 

GAACCAGATC TCTCAC CAGG 649 



FIGURE 2 

CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 2 87 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 3 35 

Gin Val Arg Met Gin He Leu Met He Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA ' 3 53 

Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAC AGG GCT GCC ACT CCC CAC CAC 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His 
130 135 140 

CGT CCC CAG CCC CGT TCT GTT CCG GGC TGG GAC TCT GCC CCC GGA GCA 4 79 

Arg Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala 
145 150 155 

CCC TCC CCA GCT GAC ATC ACC CAT CCC ACT CCA GCC CCA GGC CCC TCT 52 7 

Pro Ser Pro Ala Asp He Thr His Pro Thr Pro Ala Pro Gly Pro Ser 
160 165 170 175 

GCC CAC GCT GCA CCC AGC ACC ACC AGC GCC CTG ACC CCC GGA CCT GCC 5 75 

Ala His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala 
180 185 190 



GCT 
Ala 



GCC 
Ala 



GCT GCC GAC GCC GCA GCT TCC TCC GTT GCC AAG GGC GGG GCT T 
Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 



624 



FIGURE 2 (continued...) 



AGAGCTCAAC CCAGACACCT GCAGGTGCCG GAAGCTGCGA AGGTGACACA TGGCTTTTCA 6 84 

GACTCAGCAG GGTGACTTGC CTCAGAGGCT ATATCCCAGT GGGGGAACAA AGGGGAGCCT 744 

GGTAAAAAAC AGCCAAGCCC CCAAGACCTC AGCCCAGGCA GAAGCTGCTC TAGGACCTGG 804 

GCCTCTCAGA GGGCTCTTCT GCCATCCCTT GTCTCCCTGA GGCCATCATC AAACAGGACA 864 

GAGTTGGAAG AGGAGACTGG GAGGCAGCAA GAGGGGTCAC ATACCAGCTC AGGGGAGAAT 924 

GGAGTACTGT CTCAGTTTCT AACCACTCTG TGCAAGTAAG CATCTTACAA CTGGCTCTTC 984 

CTCCCCTCAC TAAGAAGACC CAAACCTCTG CATAATGGGA TTTGGGCTTT GGTACAAGAA 1044 

CTGTGACCCC CAACCCTGAT AAAAGAGATG GAAGGAAAAA AAAAAAAAAA 1094 



FIGURE 3 



>VEGF_ HUMAN VEGF_HUMAN VASCULAR ENDOTHELIAL GROWTH FACTOR PRECURSOR (VEGF) 
(VASCULAR 215 AA . 
Length = 215 

Score = 181 (92.4 bits) , Expect = 6.4e-20, P =: 6.4e-20 
Identities = 33/75 (44%), Positives « 48/75 (54%) 

Query: 31 HQRKVVSWIDVTTRATCQPREVWPLTVELJ^ 90 

+ + + VV +DVY R+ C+P E +V + E ' + PSCV + RCGGCC D+GLECV 

Sbjct: 36 NKHEVVKF^VYQRS YCHP IETLVD IFQEYPDE IE YI FKPSCVPIJ^CGGCCNDEGLE^ 95 

Query: 91 PTGQKQVRMQILMIR 105 

PT + + MQI + . 1 + 
Sbjct: 96 PTEESNITMQIMRIK 110 

Score = 76 (38.3 bits), Expect = 0.0011, Poisson P(2) = 9.1e-12 
Identities = 12/19 (63%) , Positives = 16/19 (84%) 

Query: 110 QLGEMSLEEHSQCSCRPKK 12 8 

-r+GEMS +H+ CECRPKK 
Sbjct: 116 HIGSMSFLQENKCECRPKX 134 

Score = 72 (36.8 bits), Expect = 0.0046, Poisson P (3 ) = 3.6e-18 
Identities = 14/21 (66%) , Positives = 15/21 (71%) 

Query: 2 02 RCQGRGLELNPDTCRCRKLRR 22 2 

RC +R LELN TCP.C k p.p. 
Sbjct: 195 RCKARQLELNERTCRCDKPRR 215 

Score = 46 (23.5 bits), Expect =47., Poisson ?(4) = 7.3e-10 
Identities = 6/10 (60%), Positives = 9/10 (90%) 



' *iery : 
Sbj ct : 



18 7 DPRTCRCRCR 196 

DP+TC+C C + 
181 DPQTCKCSCK 190 



FIGURE 4 

Gap Weight: 3.000 Average Match: 1.000 
Length Weight: 0.100 Average Mismatch: -0.900 
Quality: 100.9 Length: 739 

Ratio: 0.175 Gaps: 30 

Percent Similarity: 69.703 Percent Identity: 69.703 

28 ATGAGCCC7C7GC7CCGCCGCC7GC7GC7CGCCGCAC7 . . CC 67 

Ml!! : ! I M I ill ' i : ' : : C£ 

1? ATGAACTTTCTGCT GTCT . . . . TG3G7GCA7TGGAGCC i . Gc • 56 

68 tgcagctggcccccgcccaggcccc":gtctcc:agcc'g^*g:c:ctggc n: 
! : i ill ill i M i m ; ; i i i ! i ; • < i i m : 

57 TGCTGCTCTACC7CCACCATGCCAAGTGGTCCCAGGCT3CA.CCCA.ou. 10b 
118 C AC C AGAGGA AAG7GG7G. . . . TCATGGATAGA7 147 

i i 1 i ! ;;!';::: ■ ■ ■ 1 i ! > • 

106 AGAAGGAGGAGGGCAGAATCATCACGAAGTGG T GAAG;*CATG GA . 151 

148 GTGTATACTCGC . GC7 ACC7GCCAGCCCCGGG AG . GTGGTGG T GCCCT 193 

! ; ! j i : ! ! : 1 ;!:;:: ■ ! '* : ' ' ' ' ! 1 ' : 

152 GTCTATCAGCGCAGCTA. CTGCCATCCAA ■ CGAG-CCC ■ GG'GGACAi C . 200 
194 T GA CTG T GGAGC" r CATGGGCACCo T GGCCAAACAGCTGGTG 234 

■ i i i i i ! ; i i : ; : : ! : ; ' ; : : 

201 TCCAGGAGTACCC7GATGAGATCQAGTACATC T - . C A A ^ ^38 

235 CCCAGCTGCGTGACTG7GCAGCGC"G"GG7GGCTGC"GCCCTGACGA7GG 284 

! ! ! 1 | ! ! i I ! ! i I ! > : I ! ! ; i 1 : ' : ■ M I i ! ? » ! 

239 c c a 7 c c 7 g 7 g t g c c c c 7 g a 7 g c 3 a ~ 3 c g g g 3 g c 7 g c ~ 3 c a a t 3 a c g ag g g 288 
285 cctggag t G7G7gcc:a.ctgggcagcacca.a3"::ggatgcagat . . 329 

289 CC7GGAG7G7G7GCCCACGAGGAG-CCAAC • -:a::a"GCAGA , : A . Gl -38 

330 ■"ctcatgatccggtacccgagcag'cagc TGGGGGAGAT 363 

; ; ■ ; ; ' : - : . • ; ; ; m * m 

339 g g a t c a a a c c 7 c a ccaag . gc c agc a c at agg agagat 375 

369 GTCCCTGGAAGAACACAGCCAGTGTGAATGCAGACCTAAAAAAAAGGACA 418 

I i | j S ! : ! i i ! ! ' i ; ' : ■ I i ! I : : ! ! ! 1 ! i M : 
376 GAGCTTCCTACAGCACAACA.AATG7GAATGCAGACC . AAAGAAAGATA 422 

n 19 QTGC7GTGAAGCCAGACAGGGC7GCC a:;::c:a:ca::gt::::agcc: 468 

! i ■ > ; . : : i ' i i : I 

423 G A G C A A G A C A A G AAAATCCC. ... 442 

469 C G 7 7 C 7 G 7 7 C C G G G C 7 G G G A C 7 C 7 G C C C C C G G A G C A C C C 7 C C C C A G C 7 G A 5 18 
443 TG7GGGC:"7GC":AGi . G C G G A G A A 45/ 



519 C A T C A C C C A T C C C a C " C ' 



568 



468 • - ■ • * ^ 5£ 

569 gc ac: a::agcg:c:"gac::::gga:c"g::gc t gc:g: 60S 

■ : ' i ' \ 

469 gcatttg7"g7acaa . ... g a t c z gc - g a z g " g7 a a a 7 g7 7 c c 508 
609 tgccg.acgccgcagct-::-c:g7tgccaaggg:ggggc "agagct: 656 

509 T G . C AAA A AC AC AGa:*C . GCGT'GC . a AGGCGAGGGAGC G A j • - zo2 
557 ^ACCCAG^C-CG~GC-GG"GCGGGA^GG _ GGGA-GG'G- 595 
55^ ^ACGAACG"a:-*g::GA-3-G-:--^::G-^:GG"G- rrG 



FIGURE 5 

165SOMSQ.MSF.msf MSF: 687 Type: D Tuesday, June 20, 1995 Check: 3140 



VEGF16 5 
SOM175 
SOM175-e6 
SOM175-e6&7 
SOM175-e4 



1 80 
ATGAACTTTCTGCTGTCTTGGGTGCATTGGAGCCTTGCCTTGCTGCTCTACCTCCACCATGCCAAGTGGTCCCAGGCTG . 
ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 
ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 
ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 
ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 



VEGF165 
S0M175 
SQM175-e6 
SOM175-e6&7 
SOM175-e4 



81 160 
CACCC^TGGC^GAAGGAGGAGGGGAGAATCATCACGAAGTGGTGAAGTT 

TGCCCCTGGCCACCAGAGGAAAGTGGTGTCATGGATAGATGTGTATACTCGCG CTACCTGC . CAGCC . CCGGGAG 

TG C C C CTGG C CAC CAG AGGAAAG TGG TGT CATGG ATAGATG TG TATACT CG CG CTACCTGC . CAGCC . CCGGGAG 

TGCCCCTGGCCACCAGAGGAAAGTGGTGTCATGGATAGATGTGTATACTCGCG CTACCTGC . CAGCC . CCGGGAG 

TGCCCC TGG C CAC CAGAGGAAAG TGG TGT CATGGATAGATGTG TATACT CGCG . . . CTACCTGC . CAGCC . CCGGGAG 



VEGF165 
SOM175 
SOMl75-e6 
SOM175-e6&7 
SOM175-e4 



161 240 
CCAATCGAGACCCTGGTGGACATCTTCCAGGAGTACCCTGATGAGATCGAGTACATCTTCAAGCCATCCTGTGTGCCCCT 

GTGGTGGTGCCCTTGACTG . TGG AG CT CATGGG CAC CGTGG C CAAAC . . AGCTGGTGCCCAG CTGCGTGACTGT 

GTGGTGGTG C CCTTGACTG . TGGAGCTCATGGG CAC CGTGG C CAAAC . .AGCTGGTGCCCAG CTGCGTGACTGT 

GTGGTGGTGCCCTTGACTG . TGGAGCTCATGGG CACCGTGGCCAAAC . .AGCTGGTGCCCAG CTGCGTGACTGT 

GTGGTGGTGCCCTTGACTG . TGGAGCTCATGGG CACCGTGGCCAAAC . .AGCTGGTGCCCAG CTGCGTGACTGT 



241 320 
VEGF1 6 5 GATGCGATGCGGGGGCTGCTGCAATGACGAGGGCCTGGAGTGTGTGCCCACTGAGGAGTCCAACATCACCATGCAGATTA 
SOM 175 GCAGCGCTGTGGTGGCTGCTGCCCTGACGATGGCCTGGAGTGTGTGCCCACTGGGCAGCACCAAGTCCGGATGCAGATCC 
SOM1 7 5 - e6 GCAGCGCTGTGGTGGCTGCTGCCCTGACGATGGCCTGGAGTGTGTGCCCACTGGGCAGCACCAAGTCCGGATGCAGATCC 
SOM175-e6&7 GCAGCGCTGTGGTGGCTGCTGCCCTGACGATGGCCTGGAGTGTGTGCCCACTGGGCAGCACCAAGTCCGGATGCAGATCC 
SOM1 7 5 - e4 GCAGCGCTGTGGTGGCTGCTGCCCTGACGATGGCCTGGAGTGTGTGCCCACTGGGCAGCACCAAGTCCGGATGCAGA . . . 

321 400 
VEGF165 TG CGG ATCAAAC CT CAC CAAGGC CAGCACATAGGAGAGATGAG CTTC CTACAGCACAACAAATGTGAATGCAGACC . . .A 
SOM175 TCATGATCCGG. . . TACCCGAGCAGTCAGCTGGGGGAGATGTCCCTGGAAGAACACAGCCAGTGTGAATGCAGACCTAAA 
SOM1 7 5 - e6 TCATGATCCGG . . . TACCCGAGCAGTCAGCTGGGGGAGATGTCCCTGGAAGAACACAGCCAGTGTGAATGCAGACCTAAA 
SOM175-e6&7 TCATGATCCGG . . . TACCCGAGCAGTCAGCTGGGGGAGATGTCCCTGGAAGAACACAGCCAGTGTGAATGCAGACCTAAA 
SOM175-e4 CCTAAA 

401 480 

VEGF1 6 5 AAGAAAGATAG AG CAAG ACAAG AA .... AAT C CCTGTGG GCCTTGCTCAGAGCGGAGA 

S0M1 7 5 AAAAAGGACAGTGCTGTGAAGCCAGACAGGGCTGCCACTCCCCACCACCGTCCCCAGCCCCGTTCTGTTCCGGGCTGGGA 

SOM1 7 5 - e6 AAAAAGGACAGTGCTGTGAAGCCAGATAG 

SOM1 7 5 - e 6 &7 AAAAAGGACAGTGCTGTGAAGCCAGATAG 

SOM1 7 5 - e4 AAAAAGGACAGTGCTGTGAAGCCAGACAGGGCTGCCACTCCCCACCACCGTCCCCAGCCCCGTTCTGTTCCGGGCTGGGA 



VEGF165 
SOM175 
SOM175-e6 
SOM175-e6&7 
SOM175-e4 



481 560 

AAGCA TTTGTT TGTAC . .A 

CTCTGCCCCCGGAGCACCCTCCCCAGCTGAC^TC^CCCATCCCACTCCAGCCCCAGGCCCCTCTGCCC^CGCTGCACCCA 
CCCCAGGCCCCTCTGCCCACGCTGCACCCA 

CTCTGCCCCCGGAGCACCCTCCCCAGCTGACATCACCCATCCCACTCCAGCCCCAGGCCCCTCTGCCCACGCTGCACCCA 



VEGF165 
SOM175 
SOM175-e6 
SOM175-e6&7 
SOM175-e4 



561 640 

A GATCCGCAGACGTGTAAATGTTCCTGCAAAAAC . ACAGACTCG . . CGTTG CAAGG CG AG G CAG C 

GCACCACCAGCGCCCTGACCCCCGGACCTGCCGCTGCCGCTGCCGACGCCGCAGCTTCCTCCGTTGCCAAGGGCGGGGCT 
GCACCACCAGCGCCCTGACCCCCGGACCTGCCGCTGCCGCTGCCGACGCCGCAGCTTCCTCCGTTGCCAAGGGCGGGGCT 

GCACCACCAGCGCCCTGACCCCCGGACCTGCCGCTGCCGCTGCCGACGCCGCAGCTTCCTCCGTTGCCAAGGGCGGGGCT 



641 

VEGF1 6 5 TTGAGTTAAACGAACGTACTTGCAGATGTGACAAGCCGAGGCGGTGA 
S0M1 7 5 TAG AG CT CAAC C CAG ACAC CTG CAGG TG C CGG AAG CTG CGAAGG TG A 
SOM1 75 - e6 TAGAGCT CAAC CCAGACAC CTG CAGG TGCCGGAAG CTG CGAAGG TGA 

SOM1 75-e6&7 G TGCCGGAAG CTG CGAAGG TGA 

SOM1 7 5 - e4 TAGAGCT CAACCCAGACACCTGCAGGTGCCGGAAGCTGCGAAGGTGA 



687 



FIGURE 6 



VEGF 105 
SOM175 shon 

VEGF I6 , 
SOM175 shon 

VEGF IM 
SOMI75 shon 

VEGF I65 
SOM!75 5hon 



WVHW£(c2Lpi]Y[flH^KWSQA^MAEGGG0NHHE &FM.^Q 
RLL . i3 A H Q S AP i° ggvSQPDAPGfflRKivJsw I^T 



mm 

I FQ0YPDE I E Y I FKESjliS P L M^e^Nl^GUEpV^D EESNI [MO I M R 
LTv[e]lMGTVAKQLV gg|| T V Q RTfflp^ P jpp [GU E^ELg G Q H Q V gMg I L M 

FLQpNKE^iiS [dIaI RQENpJcGfg 




L E EH}5 ddE^^^ K D S A V K HDR A]A TPHHRPOPRSV P G WD S AgjG AjgS PAD I T^PTPAPGgSA 



■IKPHOGOHlE 
IR . YPSSQUf 



CSERRKHLF . VOD 




CKCSCKNTD5RCKARQLELNERTCRC DpjP R R 
H AAPSTTSALTPGP AAAAADAAASS V AMGGA 



60 
55 

12'. 
1 15 

170 
175 

19 i 
207 



or... 



VEGF l65 
SOMI75 long 



VEGF I65 
SOM175 



long 



VEGF-5 
SOM175 lonB 



jrjSPgRRLL. -f^^ 0 !^ 0 j^V SQPDAPGMGRK^SW I^T||A^q^ 



AT|gq|R|vv^p 



IFQSYPDE IE r IF kFeScSp L M^GjElCJQnSe GGEilVEllE E S N I tRqjIm r[?|k P H G G 0 H I rG||jjj 

ltv|blhgtvakql n |es^^ t v o fejiGS^ ppjp jaizfe^v^^ G o h o v Rpj]fi|L m ji]R . ypssql||| 



. dp;A! rIoenp 



FLQIINKldE'iRPKKS UK A 1 , kul wr jgj 

L E E H S dLE£pRK : "Kj K D S A V K PfiftAjA " p ^ QPRSVPGWDSAPGAPSPAD I THPTPAPGPLjgj 



60 
55 

121 
1 15 



170 



vegf 165 gfSqserrkhlfv obHo |T9k&P^ n t d s . rc|k a|^ gJueISe R jrCR^DK 

SOM175 Iong PRGTOHHQR . . . P pgR [fgR jCjR jgR R R S F L R CjQ Gf R G jl^EBhl P D jtGRQRg 



RR 

l(rR : 



191 
222 



Areas of 100% homology are boxed and conserved residues thought to be involved in homodimerisation are 
underlined. The VEGF sequence depicted includes the 26 amino acid leader sequence (removal of which 
gives rise to mature VEGF, 65 ) giving a total length of 191 amino acids. 

Homology of SOM175 to VEGF !65 is 27% (33%) at the protein level, however within this are blocks of 
1 00% homology. In particular, many structural residues are conserved including those thought to be 
involved in homodimerisation of VEGF (by comparison with PDGF). 
/c. Cysteine-47 

Proline-70, Cysteine-72. Valine-74 

Arginine-77, Cysteine-78, Glycine-80, Cysteines-81 & 82 
Cysteine-89, Proline-91 
Cysteines 122 & 124 



FIGURE 7 




FIGURE 8A 




« 



FIGURE 8B 



5'UTR... ATGAGG *Exon 1 (6 0bp) 

tctcccacag GCCCCT Exon 2 (4 3bp) 

tctgctccca TGGTGT Exon 3 187bp) 

ctgaatacag ATCCTC Exon 4 (73bp) 

acttttcaag ACCTAA Exon 5 (34bp) 

ctcctccgta GGCTGC Exon 6 (lOIbp) 

cccactccag CCCCAG Exon 7 (109bp) 

ccctgctcag GTGCCG *Exon 8 (22bp) 



GGCCAG 
GGAAAG 
ATGCAG 
ATGCAG 
AGACAG 
CTCCAG 
ACCCAG 
AGGTGA 



gtacgtgagg 
aatacttaca 
gtccgagctg 
gtgtcaggca 
gtgagtcttt 
ccccaggccc 
acacctgtag 
. . . 3 ' UTR 
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